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This application claims priority to co-pending U.S. provisional application 
5 serial number 60/268,584, filed February 14, 2001. 

FIELD OF THE INVENTION 

This invention relates to the field of biotechnology or genetic engineering. 
10 Specifically, this invention relates to the field of gene expression. More 
specifically, this invention relates to methods to reduce or eliminate 
transcriptional interference between two or more tandemly arranged genes 
within a host cell. 

15 BACKGROUND OF THE INVENTION 



gi " Plant biologists routinely express transgenes in plant cells. Very often, 

21 two or more genes are cloned next to each other as tandemly arranged genes on a 

M plasmid before the plasmid is introduced into plant cells. The genes of interest 

!S| 2 0 are cloned between the promoter and terminator/polyadenylation [poly(A)] 

rt* sequences to create gene expression cassettes. The orientation of the cassettes is 

usually determined by the availability of convenient restriction enzyme sites on 

the plasmid. 

Transcriptional interference refers to the phenomenon of reduced 

2 5 expression of a gene caused by the transcription from an upstream or 

downstream promoter and has been well documented in mammalian, yeast and 
prokaryotic systems (5, 6, 13). It can occur when different genes or gene 
expression cassettes are arranged in tandem, or even when these genes or 
cassettes are separated by several kilobases (5). The terminators in these gene 

3 0 constructs are not sufficient to prevent transcriptional interference between two 

adjacent genes. Possible mechanisms for reduction of transgene expression by 
an adjacent promoter include antisense RNA production that can co-suppress the 
endogenous copy of the transgene (7, 10), RNA polymerase "collision" (11), 
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localized depletion of diffusible transcription factors, and DNA topological 
constraints (16). 

Earlier studies in transgenic plants have shown that placing a promoter at 
the 3' end of a gene expression cassette and in opposite orientation reduces the 
5 expression of the upstream gene (8, 11). A poly(A) signal sequence placed on 
either side of the transgene blocked the downstream transcriptional read- 
through and restored the upstream gene expression (8). Configuration of 
transfer DNA (T-DNA) and whether the genes are placed at the left or right 
border of the T-DNA of the binary plant transformation vector (position effect) 

10 were also shown to have effects on transgene expression in plants (2, 3, 12). 

Tetracycline-dependent activation of an upstream promoter was shown to reduce 
the downstream gene expression in tomato (16). 

Even though the phenomenon of transcriptional interference is well 
known, the mechanism is poorly understood. Therefore, a need exists to provide 

15 methods to reduce transcriptional interference between tandemly arranged 
genes or expression cassettes to allow efficient or improved expression of these 
adjacent genes or cassettes. Applicant has now developed novel methods to 
overcome this problem and show herein that their novel methods effectively 
reduce or eliminate transcriptional interference between tandem genes. 

2 0 Various publications are cited herein, the disclosures of which are 

incorporated by reference in their entireties. The citation of any reference herein 
should not be construed as an admission that such reference is available as 
"Prior Art" to the instant application. 

2 5 SUMMARY OF THE INVENTION 

The present invention relates to methods to reduce or eliminate 
transcriptional interference between two or more tandemly arranged genes or 
gene expression cassettes in a host cell. 

3 0 Specifically, the present invention relates to a method to reduce or 

eliminate transcriptional interference between two tandemly arranged genes or 
gene expression cassettes in a host cell comprising introducing into the host cell 
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a polynucleotide comprising a first gene expression cassette encoding a first 
polypeptide, and a second gene expression cassette encoding a second 
polypeptide, whereby the first gene expression cassette and the second gene 
expression cassette are positioned in a tandem orientation; and eulturing the 
5 host cell under conditions, whereby transcriptional interference between the first 
gene expression cassette and the second gene expression cassette is reduced or 
eliminated and the first polypeptide and the second polypeptide are expressed. 
In a specific embodiment, the first gene expression cassette and the second gene 
expression cassette are positioned in a head (5')-to-head (5') orientation relative 

1 0 to each other. 

The present invention also relates to a method to reduce or eliminate 
transcriptional interference between two tandemly arranged genes or gene 
expression cassettes in a host cell comprising introducing into the host cell a 
polynucleotide comprising a) a first gene expression cassette encoding a first 

15 polypeptide, b) a spacer polynucleotide, and c) a second gene expression cassette 
encoding a second polypeptide, whereby the first gene expression cassette and 
the second gene expression cassette are positioned in a tandem orientation and 
the spacer polynucleotide of b) is positioned between the first expression cassette 
and the second expression cassette; and eulturing the host cell under conditions, 

2 0 whereby transcriptional interference between the first gene expression cassette 

and the second gene expression cassette is reduced or eliminated and the first 
polypeptide and the second polypeptide are expressed. 

In a specific embodiment, the first gene expression cassette and the second 
gene expression cassette are positioned in a tail (3')-to-tail (3') orientation 
25 relative to each other. In another specific embodiment, the first gene expression 
cassette and the second gene expression cassette are positioned in a head (5')-to- 
tail (3') orientation relative to each other. 

In a preferred embodiment, the spacer polynucleotide of b) comprises a 
nucleic acid sequence selected from the group consisting of SEQ ID NO: 1 [154bp 

3 0 Transcription Blocker (TB) sequence], SEQ ID NO: 2 (702 bp BstEII lambda 

fragment), SEQ ID NO: 3 (1519 bp StuI lambda fragment), and SEQ ID NO: 4 
(2322 bp Hindlll lambda fragment). 



In another preferred embodiment, the spacer polynucleotide of b) 
comprises at least a 40% adenine and thiamine nucleotide content. Preferably, 
the spacer polynucleotide of b) comprises an adenine and thiamine nucleotide 
content of at least 46%, 48%, or 63%. More preferably, the spacer polynucleotide 
5 of b) comprises a polynucleotide sequence selected from the group consisting of 
SEQ ID NO: 2 (702 bp BstEII lambda fragment), SEQ ID NO: 3 (1519 bp StuI 
lambda fragment), and SEQ ID NO: 4 (2322 bp Hindlll lambda fragment). 

The present invention also relates to a transformed host cell produced as a 
result of a method of the invention. 

10 

BRIEF DESCRIPTION OF THE DRAWINGS 

Q Figure 1. Schematic representation of gene constructs. (A) Luciferase (LUC), 

?1 green fluorescent protein (GFP) and the Renilla luciferase (RLUC) genes for 

=P I 5 plant expression are shown. Coding sequences for LUC, GFP, and RLUC are 
g flanked on the 5' end by the cassava vein mosaic virus promoter (CsVMV), the 

cauliflower mosaic virus promoter (35S), or the cauliflower mosaic virus 
duplicated promoter (E35S) and on the 3' end by the 35S terminator (35St) or 
NOS terminator (NOSt). GFP gene in addition contained tobacco etch virus 
2 0 (TEV) enhancer sequence. All genes were cloned in pBluescriptll SK \ (B) 

Plasmid constructs containing GFP and LUC genes head-to-tail (-> —»), head-to- 
head (< >), and tail-to-tail (—> <-). (C) Transcription blocker sequence (TB) or a 

X DNA fragment is placed between the GFP and LUC genes. (D) Plasmid used 
for chemical induction of LUC gene. GVE is a chimeric protein consisting of 
25 GAL4 DNA binding, VP 16 activation, and ecdysone receptor ligand binding 
domains. LUC coding sequence is placed behind the response element (where 
GVE can bind) consisting of 5 copies of GAL4 binding site (5xGAL) and minimal 
35S promoter (M35S). 
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Figure 2. Reduced expression of LUC gene due to the adjacent GFP gene. 
Protoplasts were transfected with the constructs shown on the left along with the 
RLUC plasmid as an internal control. LUC and BLUC enzyme activities were 
measured in relative light units as described in Example 1.3. LUC expression 
5 levels are shown as ratios of LUC/RLUC activities. The error bars represent 
standard deviations based on 5 to 7 independent transfections. 
Figure 3. Activation of an upstream promoter using a chemically inducible 
system reduced the expression of the constitutive downstream gene. Protoplasts 
transfected with the plasmid shown in Figure ID were divided into two parts, 
1° and one part received no chemical (Uninduced) and other part received 10 uM 
methoxyfenozide (Induced). LUC and RLUC activities are shown in relative 
light units (RLU). The error bars represent standard deviations based on 3 
S« independent transfections. 

Q- Figure 4. Transcriptional interference is eliminated by cloning transcription 

m 15 blocker or k DNA sequence between the genes. The transcription blocker 

sequence (TB), a 702, 1,519, or 2322 bp X DNA fragments were cloned between 
M GFP and LUC gene constructs shown on the left. Plasmid constructs were 

introduced into protoplasts along with the RLUC plasmid (as an internal control) 
fM and LUC and RLUC activities were assayed. LUC activities are shown as ratio of 

yj 20 LUC/RLUC activities. The error bars represent standard deviations based on 5 
•** to 7 independent transfections. 

Figure 5. Generic configuration of Expresslt™ inducible gene expression 

system. 

Figure 6. Inducible gene expression of GVE-Luc tail-to-tail construct in 
25 Arabidopsis TO plants. Luc and RLuc activities were assayed. Luc activities are 
shown as ratio of Luc/RLuc activities. 

Figure 7. Inducible gene expression of GVE-Luc tail-to-tail construct in tobacco 
TO plants. Luc and RLuc activities were assayed. Luc activities are shown as 
ratio of Luc/RLuc activities. 
3 0 Figure 8. Inducible gene expression of GVE-Luc head- to-tail construct in 

Arabidopsis TO plants. Luc and RLuc activities were assayed. Luc activities are 
shown as ratio of Luc/RLuc activities. 



Figure 9. Inducible gene expression of GVE-Luc head-to-tail construct in 
Arabidopsis Tl plants. Luc and KLuc activities were assayed. Luc activities are 
shown as ratio of Luc/RLuc activities. 

Figure 10. Inducible gene expression of GVE-Luc head- to-tail construct in 
5 tobacco TO plants. Luc and RLuc activities were assayed. Luc activities are 
shown as ratio of Luc/RLuc 
activities. 

Figure 11. Inducible gene expression of GVE-Luc head-to-head construct in 
Arabidopsis TO plants. Luc and RLuc activities were assayed. Luc activities are 
1° shown as ratio of Luc/RLuc activities. 

Figure 12. Inducible gene expression of GVE-Luc head-to-head construct in 
Arabidopsis Tl plants. Luc and RLuc activities were assayed. Luc activities are 
shown as ratio of Luc/RLuc activities. 

Figure 13. Inducible gene expression of GVE-Luc head-to-head construct in 
15 tobacco TO plants. Luc and RLuc activities were assayed. Luc activities are 
shown as ratio of Luc/RLuc activities. 

DETAILED DESCRIPTION OF THE INVENTION 

2 o Very often plants are transformed with two or more gene expression 

cassettes cloned in tandem. However, transcriptional interference between the 
two or more tandemly arranged cassettes can occur, resulting in decreased 
polypeptide production from one or more of the cassettes. Applicant has now 
shown that this transcriptional interference can be reduced or eliminated by 
25 modifying the orientation of the gene expression cassettes and/or by placing a 
spacer polynucleotide sequence between the cassettes. The present invention 
provides novel methods to reduce or eliminate transcriptional interference 
between adjacent genes and can be used to improve expression of multiple gene 
expression constructs in both prokaryotic and eukaryotic host cells. Thus, 

3 0 Applicant's novel methods overcome the limitations of tandemly arranged gene 

expression. 



DEFINITIONS 

In this disclosure, a number of terms and abbreviations are used. The 
following definitions are provided and should be helpful in understanding the 
scope and practice of the present invention. 

5 In a specific embodiment, the term "about" or "approximately" means 

within 20%, preferably within 10%, more preferably within 5%, and even more 
preferably within 1% of a given value or range. 

A "nucleic acid" is a polymeric compound comprised of covalently linked 
subunits called nucleotides. Nucleic acid includes polyribonucleic acid (RNA) 

0 and polydeoxyribonucleic acid (DNA), both of which may be single-stranded or 
double -stranded. DNA includes but is not limited to cDNA, genomic DNA, 
plasmids DNA, synthetic DNA, and semi-synthetic DNA. DNA may be linear, 
circular, or supercoiled. 

A "nucleic acid molecule" refers to the phosphate ester polymeric form of 

5 ribonucleosides (adenosine, guanosine, uridine or cytidine; "RNA molecules") or 
deoxyribonucleosides (deoxyadenosine, deoxyguanosine, deoxythymidine, or 
deoxycytidine; "DNA molecules"), or any phosphoester anologs thereof, such as 
phosphorothioates and thioesters, in either single stranded form, or a double- 
stranded helix. Double stranded DNA-DNA, DNA-RNA and RNA-RNA helices 

0 are possible. The term nucleic acid molecule, and in particular DNA or RNA 
molecule, refers only to the primary and secondary structure of the molecule, and 
does not limit it to any particular tertiary forms. Thus, this term includes 
double-stranded DNA found, inter alia, in linear or circular DNA molecules (e.g., 
restriction fragments), plasmids, and chromosomes. In discussing the structure 

5 of particular double-stranded DNA molecules, sequences may be described 

herein according to the normal convention of giving only the sequence in the 5' to 
3' direction along the non-transcribed strand of DNA (i.e., the strand having a 
sequence homologous to the mRNA). A "recombinant DNA molecule" is a DNA 
molecule that has undergone a molecular biological manipulation. 

0 As used herein, an "isolated nucleic acid fragment" is a polymer of RNA or 

DNA that is single- or double-stranded, optionally containing synthetic, non- 
natural or altered nucleotide bases. An isolated nucleic acid fragment in the 
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form of a polymer of DNA may be comprised of one or more segments of cDNA, 
genomic DNA or synthetic DNA. 

A "gene" refers to an assembly of nucleotides that encode a polypeptide, 
and includes cDNA and genomic DNA nucleic acids. "Gene" also refers to a 
5 nucleic acid fragment that expresses a specific protein, including regulatory 
sequences preceding (5' non-coding sequences) and following (3' non-coding 
sequences) the coding sequence. "Native gene" refers to a gene as found in 
nature with its own regulatory sequences. "Chimeric gene" refers to any gene 
that is not a native gene, comprising regulatory and/or coding sequences that are 
1° not found together in nature. Accordingly, a chimeric gene may comprise 
regulatory sequences and coding sequences that are derived from different 
sources, or regulatory sequences and coding sequences derived from the same 
C source, but arranged in a manner different than that found in nature. 

Q "Endogenous gene" refers to a native gene in its natural location in the genome 

!5 of an organism. A "foreign" gene or "heterologous" gene refers to a gene not 
normally found in the host organism, but that is introduced into the host 
organism by gene transfer. Foreign genes can comprise native genes inserted 
into a non-native organism, or chimeric genes. A "transgene" is a gene that has 
been introduced into the genome by a transformation procedure. 
20 "Heterologous" DNA refers to DNA not naturally located in the cell, or in a 

chromosomal site of the cell. Preferably, the heterologous DNA includes a gene 
foreign to the cell. 

The term "genome" includes chromosomal as well as mitochondrial, 
chloroplast and viral DNA or RNA. 
25 A nucleic acid molecule is "hybridizable" to another nucleic acid molecule, 

such as a cDNA, genomic DNA, or RNA, when a single stranded form of the 
nucleic acid molecule can anneal to the other nucleic acid molecule under the 
appropriate conditions of temperature and solution ionic strength (see Sambrook 
et al., 1989 infra). Hybridization and washing conditions are well known and 
3 0 exemplified in Sambrook, J., Fritsch, E. F. and Maniatis, T. Molecular Cloning: 
A Laboratory Manual. Second Edition, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor (1989), particularly Chapter 11 and Table 11.1 therein 
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(entirely incorporated herein by reference). The conditions of temperature and 
ionic strength determine the "stringency" of the hybridization. 

Stringency conditions can be adjusted to screen for moderately similar 
fragments, such as homologous sequences from distantly related organisms, to 
5 highly similar fragments, such as genes that duplicate functional enzymes from 
closely related organisms. For preliminary screening for homologous nucleic 
acids, low stringency hybridization conditions, corresponding to a T m of 55°, can 
be used, e.g., 5x SSC, 0.1% SDS, 0.25% milk, and no formamide; or 30% 
formamide, 5x SSC, 0.5% SDS). Moderate stringency hybridization conditions 

10 correspond to a higher T m , e.g., 40% formamide, with 5x or 6x SCC. High 
stringency hybridization conditions correspond to the highest T m , e.g., 50% 
formamide, 5x or 6x SCC. Hybridization requires that the two nucleic acids 
contain complementary sequences, although depending on the stringency of the 
hybridization, mismatches between bases are possible. 

15 The term "complementary" is used to describe the relationship between 

nucleotide bases that are capable of hybridizing to one another. For example, 
with respect to DNA, adenosine is complementary to thymine and cytosine is 
complementary to guanine. Accordingly, the instant invention also includes 
isolated nucleic acid fragments that are complementary to the complete 

20 sequences as disclosed or used herein as well as those substantially similar 
nucleic acid sequences. 

In a specific embodiment, the term "standard hybridization conditions" 
refers to a T m of 55°C, and utilizes conditions as set forth above. In a preferred 
embodiment, the T m is 60°C; in a more preferred embodiment, the T m is 65°C. 

25 Post-hybridization washes also determine stringency conditions. One set 

of preferred conditions uses a series of washes starting with 6X SSC, 0.5% SDS 
at room temperature for 15 min, then repeated with 2X SSC, 0.5% SDS at 45°C 
for 30 min, and then repeated twice with 0.2X SSC, 0.5% SDS at 50°C for 
30 min. A more preferred set of stringent conditions uses higher temperatures in 

3 0 which the washes are identical to those above except for the temperature of the 
final two 30 min washes in 0.2X SSC, 0.5% SDS was increased to 60°C. Another 
preferred set of highly stringent conditions uses two final washes in 0.1X SSC, 
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0.1% SDS at 65°C. Hybridization requires that the two nucleic acids comprise 
complementary sequences, although depending on the stringency of the 
hybridization, mismatches between bases are possible. 

The appropriate stringency for hybridizing nucleic acids depends on the 
5 length of the nucleic acids and the degree of complementation, variables well 
known in the art. The greater the degree of similarity or homology between two 
nucleotide sequences, the greater the value of T m for hybrids of nucleic acids 
having those sequences. The relative stability (corresponding to higher T m ) of 
nucleic acid hybridizations decreases in the following order: RNA:RNA, 

10 DNA:RNA, DNA:DNA. For hybrids of greater than 100 nucleotides in length, 
equations for calculating T m have been derived (see Sambrook et al., supra, 9.50- 
0.51). For hybridization with shorter nucleic acids, i.e., oligonucleotides, the 
position of mismatches becomes more important, and the length of the 
oligonucleotide determines its specificity (see Sambrook et al., supra, 11.7-11.8). 

15 In one embodiment the length for a hybridizable nucleic acid is at least 

about 10 nucleotides. Preferable a minimum length for a hybridizable nucleic 
acid is at least about 15 nucleotides; more preferably at least about 
20 nucleotides; and most preferably the length is at least 30 nucleotides. 
Furthermore, the skilled artisan will recognize that the temperature and wash 

20 solution salt concentration may be adjusted as necessary according to factors 
such as length of the probe. 

The term "probe" refers to a single-stranded nucleic acid molecule that can 
base pair with a complementary single stranded target nucleic acid to form a 
double-stranded molecule. 

25 As used herein, the term "oligonucleotide" refers to a nucleic acid, 

generally of at least 18 nucleotides, that is hybridizable to a genomic DNA 
molecule, a cDNA molecule, a plasmid DNA or an mRNA molecule. 
Oligonucleotides can be labeled, e.g., with 32 P-nucleotides or nucleotides to which 
a label, such as biotin, has been covalently conjugated. A labeled oligonucleotide 

3 0 can be used as a probe to detect the presence of a nucleic acid. Oligonucleotides 
(one or both of which may be labeled) can be used as PCR primers, either for 
cloning full length or a fragment of a nucleic acid, or to detect the presence of a 
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nucleic acid. An oligonucleotide can also be used to form a triple helix with a 
DNA molecule. Generally, oligonucleotides are prepared synthetically, 
preferably on a nucleic acid synthesizer. Accordingly, oligonucleotides can be 
prepared with non-naturally occurring phosphoester analog bonds, such as 
5 thioester bonds, etc. 

A "primer" is an oligonucleotide that hybridizes to a target nucleic acid 
sequence to create a double stranded nucleic acid region that can serve as an 
initiation point for DNA synthesis under suitable conditions. Such primers may 
be used in a polymerase chain reaction. 

10 "Polymerase chain reaction" is abbreviated PCR and means an in vitro 

method for enzymatically amplifying specific nucleic acid sequences. PCR 
involves a repetitive series of temperature cycles with each cycle comprising 
three stages: denaturation of the template nucleic acid to separate the strands of 
the target molecule, annealing a single stranded PCR oligonucleotide primer to 

15 the template nucleic acid, and extension of the annealed primer(s) by DNA 

polymerase. PCR provides a means to detect the presence of the target molecule 
and, under quantitative or semi- quantitative conditions, to determine the 
relative amount of that target molecule within the starting pool of nucleic acids. 
"Reverse transcription-polymerase chain reaction" is abbreviated RT-PCR 

20 and means an in vitro method for enzymatically producing a target cDNA 
molecule or molecules from an RNA molecule or molecules, followed by 
enzymatic amplification of a specific nucleic acid sequence or sequences within 
the target cDNA molecule or molecules as described above. RT-PCR also provides 
a means to detect the presence of the target molecule and, under quantitative or 

2 5 semi-quantitative conditions, to determine the relative amount of that target 

molecule within the starting pool of nucleic acids. 

A DNA "coding sequence" is a double-stranded DNA sequence which is 
transcribed and translated into a polypeptide in a cell in vitro or in vivo when 
placed under the control of appropriate regulatory sequences. "Suitable 

3 0 regulatory sequences" refer to nucleotide sequences located upstream (5' non- 

coding sequences), within, or downstream (3' non-coding sequences) of a coding 
sequence, and which influence the transcription, RNA processing or stability, or 
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translation of the associated coding sequence. Regulatory sequences may include 
promoters, translation leader sequences, introns, polyadenylation recognition 
sequences, RNA processing site, effector binding site and stem-loop structure. 
The boundaries of the coding sequence are determined by a start codon at the 5' 
5 (amino) terminus and a translation stop codon at the 3' (carboxyl) terminus. A 
coding sequence can include, but is not limited to, prokaryotic sequences, cDNA 
from mRNA, genomic DNA sequences, and even synthetic DNA sequences. If the 
coding sequence is intended for expression in a eukaryotic cell, a polyadenylation 
signal and transcription termination sequence will usually be located 3' to the 
10 coding sequence. 

"Open reading frame" is abbreviated ORF and means a length of nucleic 
acid sequence, either DNA, cDNA or RNA, that comprises a translation start 
signal or initiation codon, such as an ATG or AUG, and a termination codon and 
can be potentially translated into a polypeptide sequence. 
15 "Promoter" refers to a DNA sequence capable of controlling the expression 

of a coding sequence or functional RNA. In general, a coding sequence is located 
1;' 3' to a promoter sequence. Promoters may be derived in their entirety from a 

g native gene, or be composed of different elements derived from different 

promoters found in nature, or even comprise synthetic DNA segments. It is 
2 0 understood by those skilled in the art that different promoters may direct the 
expression of a gene in different tissues or cell types, or at different stages of 
development, or in response to different environmental or physiological 
conditions. Promoters that cause a gene to be expressed in most cell types at 
most times are commonly referred to as "constitutive promoters". Promoters 
25 that cause a gene to be expressed in a specific cell type are commonly referred to 
as "cell-specific promoters" or "tissue -specific promoters". Promoters that cause a 
gene to be expressed at a specific stage of development or cell differentiation are 
commonly referred to as "develop mentally-specific promoters" or "cell 
differentiation-specific promoters". Promoters that are induced and cause a gene 
30 to be expressed following exposure or treatment of the cell with an agent, 

biological molecule, chemical, ligand, light, or the like that induces the promoter 
are commonly referred to as "inducible promoters" or "regulatable promoters". It 
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is further recognized that since in most cases the exact boundaries of regulatory 
sequences have not been completely defined, DNA fragments of different lengths 
may have identical promoter activity. 

A "promoter sequence" is a DNA regulatory region capable of binding RNA 
5 polymerase in a cell and initiating transcription of a downstream (3' direction) 
coding sequence. For purposes of defining the present invention, the promoter 
sequence is bounded at its 3' terminus by the transcription initiation site and 
extends upstream (5' direction) to include the minimum number of bases or 
elements necessary to initiate transcription at levels detectable above 
10 background. Within the promoter sequence will be found a transcription 

initiation site (conveniently defined for example, by mapping with nuclease Si), 
M as well as protein binding domains (consensus sequences) responsible for the 

£ ■ ■ 

M binding of RNA polymerase. 

issi 

Hj A coding sequence is "under the control" of transcriptional and 

f* 15 translational control sequences in a cell when RNA polymerase transcribes the 

42 coding sequence into mRNA, which is then trans-RNA spliced (if the coding 

^ sequence contains introns) and translated into the protein encoded by the coding 

W sequence. 

M "Transcriptional and translational control sequences" are DNA regulatory 

20 sequences, such as promoters, enhancers, terminators, and the like, that provide 
f II for the expression of a coding sequence in a host cell. In eukaryotic cells, 

polyadenylation signals are control sequences. 

The term "operably linked" refers to the association of nucleic acid 
sequences on a single nucleic acid fragment so that the function of one is affected 
25 by the other. For example, a promoter is operably linked with a coding sequence 
when it is capable of affecting the expression of that coding sequence (i.e., that 
the coding sequence is under the transcriptional control of the promoter). Coding 
sequences can be operably linked to regulatory sequences in sense or antisense 
orientation. 

30 The term "expression", as used herein, refers to the transcription and 

stable accumulation of sense (mRNA) or antisense RNA derived from a nucleic 
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acid or polynucleotide. Expression may also refer to translation of mRNA into a 
polypeptide. 

The terms "cassette", "expression cassette" and "gene expression cassette" 
refer to a segment of DNA that can be inserted into a nucleic acid or 
5 polynucleotide at specific restriction sites or by homologous recombination. The 
segment of DNA comprises a polynucleotide that encodes a polypeptide of 
interest, and the cassette and restriction sites are designed to ensure insertion of 
the cassette in the proper reading frame for transcription and translation. 
"Transformation cassette" refers to a specific vector comprising a polynucleotide 

10 that encodes a polypeptide of interest and having elements in addition to the 
polynucleotide that facilitate transformation of a particular host cell. Cassettes, 
expression cassettes, gene expression cassettes and transformation cassettes of 
the invention may also comprise elements that allow for enhanced expression of 
a polynucleotide encoding a polypeptide of interest in a host cell. These elements 

15 may include, but are not limited to: a promoter, a minimal promoter, an 
enhancer, a response element, a terminator sequence, a polyadenylation 
sequence, and the like. 

The term "head-to-head" is used herein to describe the orientation of two 
nucleotide sequences, particularly two gene expression cassettes, in relation to 

20 each other. Two gene expression cassettes are positioned in a head-to-head 
orientation when the 5' end of the coding strand of one gene expression cassette 
is adjacent to the 5' end of the coding strand of the other gene expression 
cassette, whereby the direction of transcription of each gene expression cassette 
proceeds away from the 5' end of the other expression cassette. The term "head- 

25 to-head" may be abbreviated (5')-to-(5') and may also be indicated by the symbols 
(<- -») or (3'<-5'5'-»3'). 

The term "tail-to-tail" is used herein to describe the orientation of two 
nucleotide sequences, particularly two gene expression cassettes, in relation to 
each other. Two gene expression cassettes are positioned in a tail-to-tail 

3 0 orientation when the 3' end of the coding strand of one gene expression cassette 
is adjacent to the 3' end of the coding strand of the other gene expression 
cassette, whereby the direction of transcription of each gene expression cassette 
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proceeds toward the other expression cassette. The term "tail-to-tail" may be 
abbreviated (3')-to-(3') and may also be indicated by the symbols (-»■ <-) or 
(5'->3'3'<-5'). 

The term "head-to-tail" is used herein to describe the orientation of two 
5 nucleotide sequences, particularly two gene expression cassettes, in relation to 
each other. Two gene expression cassettes are positioned in a head-to-tail 
orientation when the 5' end of the coding strand of one gene expression cassette 
is adjacent to the 3' end of the coding strand of the other gene expression 
cassette, whereby the direction of transcription of each gene expression cassette 
10 proceeds in the same direction as that of the other expression cassette. The term 
"head-to-tail" may be abbreviated (5')-to-(3') and may also be indicated by the 
symbols (-> or (5'->3'5'->3'). 

The term "downstream" refers to a nucleotide sequence that is located 3' to 
reference nucleotide sequence. In particular, downstream nucleotide sequences 
15 generally relate to sequences that follow the starting point of transcription. For 
example, the translation initiation codon of a gene is located downstream of the 
start site of transcription. 
M The term "upstream" refers to a nucleotide sequence that is located 5' to 

^ reference nucleotide sequence. In particular, upstream nucleotide sequences 

20 generally relate to sequences that are located on the 5' side of a coding sequence 
fij or starting point of transcription. For example, most promoters are located 

upstream of the start site of transcription. 

The term "spacer polynucleotide" refers to a polynucleotide comprising a 
sequence of variable length and/or varying amounts of adenine and thiamine 
25 nucleotide content. The spacer polynucleotide may be inserted between two 
tandemly arranged gene expression cassettes according to the invention. The 
spacer polynucleotide may be heterologous DNA. In a specific embodiment, the 
spacer polynucleotide comprises at least 50 nucleotides or base pairs. Preferably, 
the spacer polynucleotide comprises at least 50, 75, 100, 125, 150, 250, 500, 750, 
3 0 1000, 1250, 1500, 1750, 2000, 2250, 2500, 2750, or 3000 nucleotides or base 
pairs. More preferably, the spacer polynucleotide comprises 154, 702, 1519, or 
2322 nucleotides or base pairs. In another specific embodiment, the spacer 
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polynucleotide comprises at least 40% adenine and thiamine nucleotide content. 
Preferably, the spacer polynucleotide comprises at least 46% adenine and 
thiamine nucleotide content. More preferably, the spacer polynucleotide 
comprises at least 48% adenine and thiamine nucleotide content. Even more 
5 preferably, the spacer polynucleotide comprises at least 63% adenine and 
thiamine nucleotide content. 

The terms "restriction endonuclease" and "restriction enzyme" refer to an 
enzyme that binds and cuts within a specific nucleotide sequence within double 
stranded DNA. 

1 0 "Homologous recombination" refers to the insertion of a foreign DNA 

sequence into another DNA molecule, e.g., insertion of a vector in a chromosome. 
Preferably, the vector targets a specific chromosomal site for homologous 
recombination. For specific homologous recombination, the vector will contain 
sufficiently long regions of homology to sequences of the chromosome to allow 

15 complementary binding and incorporation of the vector into the chromosome. 
Longer regions of homology, and greater degrees of sequence similarity, may 
increase the efficiency of homologous recombination. 

A "vector" is any means for the cloning of and/or transfer of a nucleic acid 
into a host cell. A vector may be a replicon to which another DNA segment may 

20 be attached so as to bring about the replication of the attached segment. A 
"replicon" is any genetic element (e.g., plasmid, phage, cosmid, chromosome, 
virus) that functions as an autonomous unit of DNA replication in vivo, i.e., 
capable of replication under its own control. The term "vector" includes both 
viral and nonviral means for introducing the nucleic acid into a cell in vitro, ex 

2 5 v i V o or in vivo. Viral vectors include retrovirus, ade no-associated virus, pox, 

baculovirus, vaccinia, herpes simplex, Epstein-Barr, adenovirus, geminivirus, 
and caulimovirus vectors. Non-viral vectors include plasmids, liposomes, 
electrically charged lipids (cytofectins), DNA-protein complexes, and 
biopolymers. In addition to a nucleic acid, a vector may also comprise one or 

3 0 more regulatory regions, and/or selectable markers useful in selecting, 

measuring, and monitoring nucleic acid transfer results (transfer to which 
tissues, duration of expression, etc.). 
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The term "plasmid" refers to an extra chromosomal element often carrying 
a gene that is not part of the central metabolism of the cell, and usually in the 
form of circular double-stranded DNA molecules. Such elements may be 
autonomously replicating sequences, genome integrating sequences, phage or 
5 nucleotide sequences, linear, circular, or supercoiled, of a single- or double- 
stranded DNA or RNA, derived from any source, in which a number of nucleotide 
sequences have been joined or recombined into a unique construction which is 
capable of introducing a promoter fragment and DNA sequence for a selected 
gene product along with appropriate 3' untranslated sequence into a cell. 

10 A "cloning vector" is a "replicon", which is a unit length of DNA that 

replicates sequentially and which comprises an origin of replication, such as a 
plasmid, phage or cosmid, to which another DNA segment may be attached so as 
to bring about the replication of the attached segment. Cloning vectors may be 
capable of replication in one cell type and expression in another ("shuttle 

15 vector"). 

A cell has been "transfected" by exogenous or heterologous DNA when 
such DNA has been introduced inside the cell. A cell has been "transformed" by 
exogenous or heterologous DNA when the transfected DNA effects a phenotypic 
change. The transforming DNA can be integrated (covalently linked) into 

2 0 chromosomal DNA making up the genome of the cell. 

"Transformation" refers to the transfer of a nucleic acid fragment into the 
genome of a host organism, resulting in genetically stable inheritance. Host 
organisms containing the transformed nucleic acid fragments are referred to as 
"transgenic" or "recombinant" or "transformed" organisms. 
25 The term "genetic region" will refer to a region of a nucleic acid molecule 

or a nucleotide sequence that comprises a gene encoding a polypeptide. 

The term "selectable marker" means an identifying factor, usually an 
antibiotic or chemical resistance gene, that is able to be selected for based upon 
the marker gene's effect, i.e., resistance to an antibiotic, resistance to a herbicide, 

3 0 and the like, wherein the effect is used to track the inheritance of a nucleic acid 

of interest and/or to identify a cell or organism that has inherited the nucleic 
acid of interest. Examples of reporter genes known and used in the art include: 
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genes providing resistance to streptomycin, gentamycin, kanamycin, 
hygromycin, bialaphos herbicide, sulfonamide, and the like; and genes that are 
used as phenotypic markers, i.e., anthocyanin regulatory genes, isopentanyl 
transferase gene, and the like. 

5 The term "reporter gene" means a nucleic acid encoding an identifying 

factor that is able to be identified based upon the reporter gene's effect, wherein 
the effect is used to track the inheritance of a nucleic acid of interest, to identify 
a cell or organism that has inherited the nucleic acid of interest, and/or to 
measure gene expression induction or transcription. Examples of reporter genes 

0 known and used in the art include: luciferase (Luc), green fluorescent protein 
(GFP), chloramphenicol acetyltransferase (CAT), P-galactosidase (LacZ), (3- 
glucuronidase (Gus), and the like. 

The plasmids or vectors according to the invention may further comprise 
at least one promoter suitable for driving expression of a gene in a host cell. The 

5 term "expression vector" means a vector, plasmid or vehicle designed to enable 
the expression of an inserted nucleic acid sequence following transformation into 
the host. The cloned gene, i.e., the inserted nucleic acid sequence, is usually 
placed under the control of control elements such as a promoter, a minimal 
promoter, an enhancer, or the like. Initiation control regions or promoters, 

0 which are useful to drive expression of a nucleic acid in the desired host cell are 
numerous and familiar to those skilled in the art. Virtually any promoter 
capable of driving these genes is suitable for the present invention including but 
not limited to: viral promoters, plant promoters, bacterial promoters, animal 
promoters, mammalian promoters, synthetic promoters, constitutive promoters, 

5 tissue specific promoter, developmental specific promoters, inducible promoters, 
light regulated promoters; CYCl, HIS3, GAL1, GAL4, GAL 10, ADHl, PGK, 
PH05, GAPDH, ADC1, TRP1, URA3, LEU2, ENO, TPI, alkaline phosphatase 
promoters (useful for expression in Saccharomyces); AOX1 promoter (useful for 
expression in Pichia); b-lactamase, lac, ara, tet, trp, IPj^, IPr, T7, tac, and trc 

0 promoters (useful for expression in Escherichia coli); and light regulated-, seed 
specific-, pollen specific-, ovary specific-, pathogenesis or disease related-, 
cauliflower mosaic virus (CMV) 35S, CMV 35S minimal, cassava vein mosaic 
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virus (CsVMV), chlorophyll a/b binding protein, ribulose 1, 5-bisphosphate 
carboxylase, shoot-specific, root specific, chitinase, stress inducible, rice tungro 
bacilliform virus, plant super-promoter, potato leucine aminopeptidase, nitrate 
reductase, mannopine synthase, nopaline synthase, ubiquitin, zein protein, and 
5 anthocyanin promoters (useful for expression in plant cells); animal and 

mammalian promoters known in the art include, but are not limited to, the SV40 
early promoter region, the promoter contained in the 3' long terminal repeat 
(LTR) of Rous sarcoma virus (RSV), the cytomegalovirus (CMV) early promoter, 
the herpes thymidine kinase (TK) promoter, the regulatory sequences of the 

1 0 metallothionein gene, and transcriptional control regions, which exhibit tissue 
specificity and have been utilized in transgenic animals, such as the elastase I 
gene control region which is active in pancreatic acinar cells; insulin gene control 
region which is active in pancreatic beta cells, immunoglobulin gene control 
region which is active in lymphoid cells, mouse mammary tumor virus control 

15 region which is active in testicular, breast, lymphoid and mast cells, albumin 
gene control region which is active in liver, alpha-fetoprotein gene control region 
which is active in liver, alpha 1-antitrypsin gene control region which is active in 
the liver, beta-globin gene control region which is active in myeloid cells, myelin 
basic protein gene control region which is active in oligodendrocyte cells in the 

2 0 brain, myosin light chain-2 gene control region which is active in skeletal 

muscle, and gonadotropic releasing hormone gene control region which is active 
in the hypothalamus, and the like. In a preferred embodiment of the invention, 
the promoter is selected from the group consisting of a cauliflower mosaic virus 
35S promoter, a cassava vein mosaic virus promoter, and a cauliflower mosaic 

25 virus 35S minimal promoter. 

Enhancers that may be used in embodiments of the invention include but 
are not limited to: tobacco mosaic virus enhancer, cauliflower mosaic virus 35S 
enhancer, tobacco etch virus enhancer, ribulose 1, 5-bisphosphate carboxylase 
enhancer, rice tungro bacilliform virus enhancer, and other plant and viral gene 

30 enhancers, and the like. 

Termination control regions, i.e., terminator or polyadenylation sequences, 
may also be derived from various genes native to the preferred hosts. 
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Optionally, a termination site may be unnecessary, however, it is most preferred 
if included. In a preferred embodiment of the invention, the termination control 
region may be comprise or be derived from a synthetic sequence, nopaline 
synthase (nos), cauliflower mosaic virus (CaMV), octopine synthase (ocs), 
5 Agrobacterium, viral, and plant terminator sequences, or the like. 

The terms "3' non-coding sequences" or "3' untranslated region (UTR)" 
refer to DNA sequences located downstream (3') of a coding sequence and may 
comprise polyadenylation [poly(A)] recognition sequences and other sequences 
encoding regulatory signals capable of affecting mRNA processing or gene 

1 0 expression. The polyadenylation signal is usually characterized by affecting the 
addition of polyadenylic acid tracts to the 3' end of the mRNA precursor. 

"Regulatory region" means a nucleic acid sequence which regulates the 
expression of a second nucleic acid sequence. A regulatory region may include 
sequences which are naturally responsible for expressing a particular nucleic 

15 acid (a homologous region) or may include sequences of a different origin which 
are responsible for expressing different proteins or even synthetic proteins (a 
heterologous region). In particular, the sequences can be sequences of 
prokaryotic, eukaryotic, or viral genes or derived sequences which stimulate or 
repress transcription of a gene in a specific or non-specific manner and in an 

2 0 inducible or non- inducible manner. Regulatory regions include origins of 

replication, RNA splice sites, promoters, enhancers, transcriptional termination 
sequences, and signal sequences which direct the polypeptide into the secretory 
pathways of the target cell. 

A regulatory region from a "heterologous source" is a regulatory region 
25 which is not naturally associated with the expressed nucleic acid. Included 
among the heterologous regulatory regions are regulatory regions from a 
different species, regulatory regions from a different gene, hybrid regulatory 
sequences, and regulatory sequences which do not occur in nature, but which are 
designed by one having ordinary skill in the art. 

3 0 "RNA transcript" refers to the product resulting from RNA polymerase- 

catalyzed transcription of a DNA sequence. When the RNA transcript is a 
perfect complementary copy of the DNA sequence, it is referred to as the primary 
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transcript or it may be a RNA sequence derived from post-transcriptional 
processing of the primary transcript and is referred to as the mature RNA. 
"Messenger RNA (mRNA)" refers to the RNA that is without introns and that 
can be translated into protein by the cell. "cDNA" refers to a double-stranded 
5 DNA that is complementary to and derived from mRNA. "Sense" RNA refers to 
RNA transcript that includes the mRNA and so can be translated into protein by 
the cell. "Antisense RNA" refers to a RNA transcript that is complementary to 
all or part of a target primary transcript or mRNA and that blocks the 
expression of a target gene. The complementarity of an antisense RNA may be 
10 with any part of the specific gene transcript, i.e., at the 5' non-coding sequence, 
3' non-coding sequence, or the coding sequence. "Functional RNA" refers to 
antisense RNA, ribozyme RNA, or other RNA that is not translated yet has an 
effect on cellular processes. 

A "polypeptide" is a polymeric compound comprised of covalently linked 
15 amino acid residues. Amino acids have the following general structure: 

H 
I 

R-C-COOH 



S 20 NH 2 

ilj Amino acids are classified into seven groups on the basis of the side chain R: (1) 

aliphatic side chains, (2) side chains containing a hydroxylic (OH) group, (3) side 
chains containing sulfur atoms, (4) side chains containing an acidic or amide 
group, (5) side chains containing a basic group, (6) side chains containing an 
25 aromatic ring, and (7) proline, an imino acid in which the side chain is fused to 
the amino group. A polypeptide of the invention preferably comprises at least 
about 14 amino acids. 

A "protein" is a polypeptide that performs a structural or functional role in 
a living cell. 

30 A "variant" of a polypeptide or protein is any analogue, fragment, 

derivative, or mutant which is derived from a polypeptide or protein and which 
retains at least one biological property of the polypeptide or protein. Different 
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variants of the polypeptide or protein may exist in nature. These variants may 
be allelic variations characterized by differences in the nucleotide sequences of 
the structural gene coding for the protein, or may involve differential splicing or 
post-translational modification. The skilled artisan can produce variants having 
5 single or multiple amino acid substitutions, deletions, additions, or 

replacements. These variants may include, inter alia: (a) variants in which one 
or more amino acid residues are substituted with conservative or non- 
conservative amino acids, (b) variants in which one or more amino acids are 
added to the polypeptide or protein, (c) variants in which one or more of the 

1° amino acids includes a substituent group, and (d) variants in which the 

polypeptide or protein is fused with another polypeptide such as serum albumin. 
The techniques for obtaining these variants, including genetic (suppressions, 
deletions, mutations, etc.), chemical, and enzymatic techniques, are known to 
persons having ordinary skill in the art. A variant polypeptide preferably 

15 comprises at least about 14 amino acids. 

A "heterologous protein" refers to a protein not naturally produced in the 

cell. 

A "mature protein" refers to a post-translationally processed polypeptide; 

i.e., one from which any pre- or propeptides present in the primary translation 
20 product have been removed. "Precursor" protein refers to the primary product of 

translation of mRNA; i.e., with pre- and propeptides still present. Pre- and 

propeptides may be but are not limited to intracellular localization signals. 
The term "signal peptide" refers to an amino terminal polypeptide 

preceding the secreted mature protein. The signal peptide is cleaved from and is 
25 therefore not present in the mature protein. Signal peptides have the function of 

directing and translocating secreted proteins across cell membranes. Signal 

peptide is also referred to as signal protein. 

A "signal sequence" is included at the beginning of the coding sequence of 

a protein to be expressed on the surface of a cell. This sequence encodes a signal 
3 0 peptide, N-terminal to the mature polypeptide that directs the host cell to 

translocate the polypeptide. The term "translocation signal sequence" is used 

herein to refer to this sort of signal sequence. Translocation signal sequences 
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can be found associated with a variety of proteins native to eukaryotes and 
prokaryotes, and are often functional in both types of organisms. 

The term "homology" refers to the percent of identity between two 
polynucleotide or two polypeptide moieties. The correspondence between the 
5 sequence from one moiety to another can be determined by techniques known to 
the art. For example, homology can be determined by a direct comparison of the 
sequence information between two polypeptide molecules by aligning the 
sequence information and using readily available computer programs. 
Alternatively, homology can be determined by hybridization of polynucleotides 

10 under conditions which form stable duplexes between homologous regions, 
followed by digestion with single-stranded-specific nuclease(s) and size 
determination of the digested fragments. 

Accordingly, the term "sequence similarity" in all its grammatical forms 
refers to the degree of identity or correspondence between nucleic acid or amino 

15 acid sequences of proteins that may or may not share a common evolutionary 
origin (see Reeck et al., 1987, Cell 50:667). However, in common usage and in 
the instant application, the term "homologous," when modified with an adverb 
such as "highly," may refer to sequence similarity and not a common 
evolutionary origin. 

20 As used herein, the term "homologous" in all its grammatical forms and 

spelling variations refers to the relationship between proteins that possess a 
"common evolutionary origin," including proteins from superfamilies and 
homologous proteins from different species (Reeck et al., supra). Such proteins 
(and their encoding genes) have sequence homology, as reflected by their high 

2 5 degree of sequence similarity. 

In a specific embodiment, two DNA sequences are "substantially 
homologous" or "substantially similar" when at least about 50% (preferably at 
least about 75%, and most preferably at least about 90 or 95%) of the nucleotides 
match over the defined length of the DNA sequences. Sequences that are 

3 0 substantially homologous can be identified by comparing the sequences using 

standard software available in sequence data banks, or in a Southern 
hybridization experiment under, for example, stringent conditions as defined for 
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that particular system. Defining appropriate hybridization conditions is within 
the skill of the art. See, e.g., Sambrook et al., 1989, supra. 

The term "corresponding to" is used herein to refer to similar or 
homologous sequences, whether the exact position is identical or different from 
5 the molecule to which the similarity or homology is measured. A nucleic acid or 
amino acid sequence alignment may include spaces. Thus, the term 
"corresponding to" refers to the sequence similarity, and not the numbering of 
the amino acid residues or nucleotide bases. 

A "substantial portion" of an amino acid or nucleotide sequence comprises 

10 enough of the amino acid sequence of a polypeptide or the nucleotide sequence of 
a gene to putatively identify that polypeptide or gene, either by manual 
evaluation of the sequence by one skilled in the art, or by computer-automated 
sequence comparison and identification using algorithms such as BLAST (Basic 
Local Alignment Search Tool; Altschul, S. F., et al., (1993) J. Mol. Biol. 

15 215:403-410; see also www.ncbi.nlm.nih.gov/BLAST/). In general, a sequence of 
ten or more contiguous amino acids or thirty or more nucleotides is necessary in 
order to putatively identify a polypeptide or nucleic acid sequence as homologous 
to a known protein or gene. Moreover, with respect to nucleotide sequences, 
gene specific oligonucleotide probes comprising 20-30 contiguous nucleotides may 

20 be used in sequence-dependent methods of gene identification (e.g., Southern 
hybridization) and isolation (e.g., in situ hybridization of bacterial colonies or 
bacteriophage plaques). In addition, short oligonucleotides of 12-15 bases may 
be used as amplification primers in PCR in order to obtain a particular nucleic 
acid fragment comprising the primers. Accordingly, a "substantial portion" of a 

25 nucleotide sequence comprises enough of the sequence to specifically identify 
and/or isolate a nucleic acid fragment comprising the sequence. 

The term "percent identity", as known in the art, is a relationship between 
two or more polypeptide sequences or two or more polynucleotide sequences, as 
determined by comparing the sequences. In the art, "identity" also means the 

3 0 degree of sequence relatedness between polypeptide or polynucleotide sequences, 
as the case may be, as determined by the match between strings of such 
sequences. "Identity" and "similarity" can be readily calculated by known 
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methods, including but not limited to those described in: Computational 
Molecular Biology (Lesk, A. M., ed.) Oxford University Press, New York (1988); 
Biocomputing: Informatics and Genome Projects (Smith, D. W., ed.) Academic 
Press, New York (1993); Computer Analysis of Sequence Data, Part I (Griffin, A. 
5 M., and Griffin, H. G., eds.) Humana Press, New Jersey (1994); Sequence 
Analysis in Molecular Biology (von Heinje, G., ed.) Academic Press (1987); and 
Sequence Analysis Primer (Gribskov, M. and Devereux, J., eds.) Stockton Press, 
New York (1991). Preferred methods to determine identity are designed to give 
the best match between the sequences tested. Methods to determine identity 

1° and similarity are codified in publicly available computer programs. Sequence 
alignments and percent identity calculations may be performed using the 
Megalign program of the LASERGENE bioinformatics computing suite 
(DNASTAR Inc., Madison, WI). Multiple alignment of the sequences was 
performed using the Clustal method of alignment (Higgins and Sharp (1989) 

15 CABIOS. 5:151-153) with the default parameters (GAP PENALTY=10, GAP 

LENGTH PENALTY=10). Default parameters for pairwise alignments using the 
Clustal method were KTUPLE 1, GAP PENALTY=3, WINDOW=5 and 
DIAGONALS SAVED=5. 

The term "sequence analysis software" refers to any computer algorithm 

20 or software program that is useful for the analysis of nucleotide or amino acid 
sequences. "Sequence analysis software" may be commercially available or 
independently developed. Typical sequence analysis software will include but is 
not limited to the GCG suite of programs (Wisconsin Package Version 9.0, 
Genetics Computer Group (GCG), Madison, WI), BLASTP, BLASTN, BLASTX 

25 (Altschul et al., J. Mol. Biol. 215:403-410 (1990), and DNASTAR (DNASTAR, 
Inc. 1228 S. Park St. Madison, WI 53715 USA). Within the context of this 
application it will be understood that where sequence analysis software is used 
for analysis, that the results of the analysis will be based on the "default values" 
of the program referenced, unless otherwise specified. As used herein "default 

3 0 values" will mean any set of values or parameters which originally load with the 
software when first initialized. 
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"Synthetic genes" can be assembled from oligonucleotide building blocks 
that are chemically synthesized using procedures known to those skilled in the 
art. These building blocks are ligated and annealed to form gene segments 
which are then enzymatically assembled to construct the entire gene. 
5 "Chemically synthesized", as related to a sequence of DNA, means that the 
component nucleotides were assembled in vitro. Manual chemical synthesis of 
DNA may be accomplished using well-established procedures, or automated 
chemical synthesis can be performed using one of a number of commercially 
available machines. Accordingly, the genes can be tailored for optimal gene 
10 expression based on optimization of nucleotide sequence to reflect the codon bias 
of the host cell. The skilled artisan appreciates the likelihood of successful gene 
expression if codon usage is biased towards those codons favored by the host. 
Determination of preferred codons can be based on a survey of genes derived 
from the host cell where sequence information is available. 

15 

Methods of the Invention 

Applicant's invention provides methods to express two or more genes of 
interest in a host cell. More specifically, this invention provides methods to 
reduce or eliminate transcriptional interference between two or more tandemly 
2 o arranged genes or gene expression cassettes in a host cell. The methods 



according to the invention may be used to express two or more expression 

few 

Hi cassettes to produce two or more desired polypeptides within a host cell. 

In a specific embodiment, the present invention relates to a method to 
reduce or eliminate transcriptional interference between two tandemly arranged 

25 genes or gene expression cassettes in a host cell comprising introducing into the 
host cell a polynucleotide comprising a first gene expression cassette encoding a 
first polypeptide, and a second gene expression cassette encoding a second 
polypeptide, whereby the first gene expression cassette and the second gene 
expression cassette are positioned in a tandem orientation; and culturing the 

3 0 host cell under conditions, whereby transcriptional interference between the first 
gene expression cassette and the second gene expression cassette is reduced or 
eliminated and the first polypeptide and the second polypeptide are expressed. 
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In a specific embodiment, the present invention relates to a method to 
reduce or ehminate transcriptional interference between two tandemly arranged 
genes or gene expression cassettes in a host cell comprising introducing into the 
host cell a polynucleotide comprising a first gene expression cassette encoding a 
5 first polypeptide, and a second gene expression cassette encoding a second 
polypeptide, whereby the first gene expression cassette and the second gene 
expression cassette are positioned in a head (5')-to-head (5') orientation relative 
to each other; and culturing the host cell under conditions, whereby 
transcriptional interference between the first gene expression cassette and the 

!0 second gene expression cassette is reduced or eliminated and the first 
polypeptide and the second polypeptide are expressed. 

In a specific embodiment, the present invention relates to a method to 
reduce or eliminate transcriptional interference between two tandemly arranged 
genes or gene expression cassettes in a host cell comprising introducing into the 

15 host cell a polynucleotide comprising a) a first gene expression cassette encoding 
a first polypeptide, b) a spacer polynucleotide, and c) a second gene expression 
cassette encoding a second polypeptide, whereby the first gene expression 
cassette and the second gene expression cassette are positioned in a tandem 
orientation and the spacer polynucleotide of b) is positioned between the first 

2 0 expression cassette and the second expression cassette; and culturing the host 

cell under conditions, whereby transcriptional interference between the first 
gene expression cassette and the second gene expression cassette is reduced or 
eliminated and the first polypeptide and the second polypeptide are expressed. 

In another specific embodiment, the present invention relates to a method 
25 to reduce or eliminate transcriptional interference between two or more 

tandemly arranged genes or gene expression cassettes in a host cell comprising 
introducing into the host cell a polynucleotide comprising a) a first gene 
expression cassette, b) a spacer polynucleotide, and c) a second gene expression 
cassette, whereby the first gene expression cassette and the second gene 

3 0 expression cassette are positioned in a tail (3')-to-tail (3') orientation relative to 

each other and the spacer polynucleotide of b) is positioned between the first 
expression cassette and the second expression cassette; and culturing the host 
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cell under conditions, whereby transcriptional interference between the first 
gene expression cassette and the second gene expression cassette is reduced or 
eliminated and the first polypeptide and the second polypeptide are expressed. 
In a preferred embodiment, the spacer polynucleotide of b) is a transcription 
5 blocker polynucleotide comprising a nucleic acid sequence of SEQ ID NO: 1. 

In another specific embodiment, the present invention relates to a method 
to reduce or eliminate transcriptional interference between two or more 
tandemly arranged genes or gene expression cassettes in a host cell comprising 
introducing into the host cell a polynucleotide comprising a) a first gene 

10 expression cassette, b) a spacer polynucleotide, and c) a second gene expression 
cassette, whereby the first gene expression cassette and the second gene 
expression cassette are positioned in a head (5')-to-tail (3') orientation relative to 
each other and the spacer polynucleotide of b) is positioned between the first 
expression cassette and the second expression cassette; and culturing the host 

15 cell under conditions, whereby transcriptional interference between the first 
gene expression cassette and the second gene expression cassette is reduced or 
eliminated and the first polypeptide and the second polypeptide are expressed. 
In a preferred embodiment, the spacer polynucleotide of b) is a transcription 
blocker polynucleotide comprising a nucleic acid sequence of SEQ ID NO: 1. 

20 l n another specific embodiment, the present invention relates to a method 

to reduce or eliminate transcriptional interference between two or more 
tandemly arranged genes or gene expression cassettes in a host cell comprising 
introducing into the host cell a polynucleotide comprising a) a first gene 
expression cassette, b) a spacer polynucleotide comprising at least a 40% adenine 

25 and thiamine nucleotide content, and c) a second gene expression cassette, 
whereby the first gene expression cassette and the second gene expression 
cassette are positioned in a head (5')-to-tail (3') orientation relative to each other 
and the spacer polynucleotide of b) is positioned between the first expression 
cassette and the second expression cassette; and culturing the host cell under 

3 0 conditions, whereby transcriptional interference between the first gene 
expression cassette and the second gene expression cassette is reduced or 
eliminated and the first polypeptide and the second polypeptide are expressed. 
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Preferably, the spacer polynucleotide of b) comprises at least a 46% 
adenine and thiamine nucleotide content. In a specific embodiment, the spacer 
polynucleotide of b) comprising at least a 46% adenine and thiamine nucleotide 
content comprises a polynucleotide sequence comprising SEQ ID NO: 2 (702 bp 
5 BstEII lambda fragment). 

More preferably, the spacer polynucleotide of b) comprises at least a 48% 
adenine and thiamine nucleotide content. In a specific embodiment, the spacer 
polynucleotide of b) comprising at least a 48% adenine and thiamine nucleotide 
content comprises a polynucleotide sequence comprising SEQ ID NO: 3 (1519 bp 
10 StuI lambda fragment). 

Even more preferably, the spacer polynucleotide of b) comprises at least a 
63% adenine and thiamine nucleotide content. In a specific embodiment, the 
spacer polynucleotide of b) comprising at least a 63% adenine and thiamine 
nucleotide content comprises a polynucleotide sequence comprising SEQ ID NO: 
15 4 (2322 bp Hindlll lambda fragment). 

In another specific embodiment, the present invention relates to a method 
to reduce or eliminate transcriptional interference between two or more 
tandemly arranged genes or gene expression cassettes in a host cell comprising 
||l introducing into the host cell a polynucleotide comprising a) a first gene 

2 0 expression cassette, b) a spacer polynucleotide, and c) a second gene expression 
cassette, whereby the first gene expression cassette and the second gene 
expression cassette are positioned in a head (5>to-tail (3') orientation relative to 
each other and the spacer polynucleotide of b) is positioned between the first 
expression cassette and the second expression cassette; and culturing the host 

2 5 cell under conditions, whereby transcriptional interference between the first 
gene expression cassette and the second gene expression cassette is reduced or 
eliminated and the first polypeptide and the second polypeptide are expressed. 
Preferably, the spacer polynucleotide of b) comprises a polynucleotide sequence 
comprising SEQ ID NO: 2 (702 bp BstEII lambda fragment). More preferably, 

3 0 the spacer polynucleotide of b) comprises a polynucleotide sequence comprising 
SEQ ID NO: 3 (1519 bp StuI lambda fragment). Even more preferably, the 
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spacer polynucleotide of b) comprises a polynucleotide sequence comprising SEQ 
ID NO: 4 (2322 bp Hindlll lambda fragment). 

Host Cells of the Invention 
5 As described above, the methods of the present invention may be used to 

express two or more gene expression cassettes in a host cell. The host cell may 
be a prokaryotic and eukaryotic host cell, and may be selected from the group 
consisting of bacterial, fungal, yeast, plant, animal, and mammalian host cells. 
Examples of bacterial, fungal and yeast host cells include but are not limited to: 
10 Aspergillus, Trichoderma, Saccharomyces, Pichia, Candida, Hansenula, 

Synechocystis, Synechococcus, Salmonella, Bacillus, Acinetobacter, Rhodococcus, 
Streptomyces, Escherichia, Pseudomonas, Methylomonas, Methylobacter, 

M 

'm. Alcaligenes, Synechocystis, Anabaena, Thiobacillus, Methanobacterium and 

W- Klebsiella. 

45 15 In a preferred embodiment, this invention provides methods to reduce or 

=2 eliminate transcriptional interference between two or more genes or gene 

*P expression cassettes in a eukaryotic host cell. In a more preferred embodiment, 

q this invention provides methods to reduce or eliminate transcriptional 

:^ interference between two or more genes or gene expression cassettes in a plant 

yj 2 0 host cell. The plant host cell may be either a monocot or dicot plant cell. In a 
;r: specific embodiment, the plant host cell is selected from the group consisting of 

an apple, Arabidopsis, bajra, banana, barley, beans, beet, blackgram, chickpea, 
chilies, cucumber, eggplant, favabean, maize, melon, millet, mungbean, oat, 
okra, Panicum, papaya, peanut, pea, pepper, pigeonpea, pineapple, Phaseolus, 
25 potato, pumpkin, rice, sorghum, soybean, squash, sugarcane, sugarbeet, 

sunflower, sweet potato, tea, tomato, tobacco, watermelon, and wheat host cell. 

The present invention also relates to a transformed host cell that is 
produced as a result of a method of the invention. Thus, the present invention 
relates to a transformed host cell comprising two or more gene expression 
3 0 cassettes resulting from Applicant's methods. Preferably, the transformed host 
cell is selected from the group consisting of an apple, Arabidopsis, bajra, banana, 
barley, beans, beet, blackgram, chickpea, chilies, cucumber, eggplant, favabean, 
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maize, melon, millet, mungbean, oat, okra, Panicum, papaya, peanut, pea, 
pepper, pigeonpea, pineapple, Phaseolus, potato, pumpkin, rice, sorghum, 
soybean, squash, sugarcane, sugarbeet, sunflower, sweet potato, tea, tomato, 
tobacco, watermelon, and wheat host cell. 

5 

Plant Cell Transformation and Gene Expression 

Gene expression in transgenic plant host cells or transgenic plants is 
useful for production of various desirable traits or gene products, including but 
not limited to: control of flowering, herbicide resistance, fungicide resistance, 

1° insecticide resistance, plant size or form, nutrient content, drought- tolerance, 
and the like; various pathway intermediates for the modulation of pathways 
already existing in the host for the synthesis of new products heretofore not 
possible using the host; examination and determination of ectopic gene 
expression and gene function, particularly as related to the fields of genomics 

15 and proteomics. Additionally, the gene products may be useful for conferring 
higher growth yields of the host or for enabling alternative growth mode to be 
utilized. 



Plant cell transformation is well known in the art (see Trigiano and Gray, 



M 2 nd edition, 2000, CRC press, New York; Gartland and Davey, 1995, Humana 

20 Press, Totowa, New Jersey; Maliga et al., 995, Cold Spring Harbor Lab Press, 
New York; and Sambrook et al., 1989, Cold Spring Harbor Lab Press, New York) 
and may be achieved by various methods including but not limited to: 
electroporation, Agrobacterium-medi&ted, particle bombardment, and the like. 
Expression of desired gene products involves growing the transformed host cells 
25 under conditions that induce expression of the transformed genes or gene 
expression cassettes. The cells are harvested and gene products are isolated 
according to protocols specific for the gene product. 

In a specific embodiment of the present invention, the transformed host 
cell is an apple, Arabidopsis, bajra, banana, barley, beans, beet, blackgram, 
3 0 chickpea, chilies, cucumber, eggplant, favabean, maize, melon, millet, 
mungbean, oat, okra, Panicum, papaya, peanut, pea, pepper, pigeonpea, 
pineapple, Phaseolus, potato, pumpkin, rice, sorghum, soybean, squash, 
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sugarcane, sugarbeet, sunflower, sweet potato, tea, tomato, tobacco, watermelon, 
or wheat cell. 

Genes of Interest 

Nucleic acid sequence information for a desired protein can be located in 
one of many public access databases, for example, GENBANK, EMBL, Swiss- 
Prot, and PIR, or in many biology related journal publications. Thus, those 
skilled in the art have access to nucleic acid sequence information for virtually 
all known genes. Such information can then be used to construct the desired 
constructs for the insertion of the gene of interest within the expression cassettes 
used in Applicant's methods described herein. 

In a specific embodiment of the invention, at least one of the gene 
expression cassettes comprises a homologous gene with respect to the host cell. 
In another specific embodiment of the invention, at least one of the gene 
expression cassettes comprises a heterologous gene with respect to the host cell. 
Examples of genes of interest for expression in a host cell using Applicant's 
methods include, but are not limited to: antigens produced in plants as vaccines, 
enzymes like alpha-amylase, phytase, glucanes, and xylanse, genes for 
resistance against insects, nematodes, fungi, bacteria, viruses, and abiotic 
stresses, nutraceuticals, pharmaceuticals, vitamins, genes for modifying amino 
acid content, herbicide resistance, cold, drought, and heat tolerance, industrial 
products, oils, protein, carbohydrates, antioxidants, male sterile plants, flowers, 
fuels, and other output traits, and the like. 

25 Measuring Gene Expression/Transcription 

One useful measurement in Applicant's methods of the invention is that of 
the transcriptional state of the cell which includes the identities and abundance 
of RNA, preferably mRNA species. Such measurements are conveniently 
conducted by measuring cDNA abundance by any of several existing gene 

3 0 expression technologies well known in the art. 
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Another biological pathway response measurement is by determining the 
translation state of the cell by measuring the abundances of the constituent 
protein species present in the cell using processes well known in the art. 

Where identification of genes associated with various physiological 
5 functions is desired, an assay may be employed in which changes in such 

functions as cell growth, apoptosis, senescence, differentiation, adhesion, binding 
to a specific molecules, binding to another cell, cellular organization, 
organogenesis, intracellular transport, transport facilitation, energy conversion, 
metabolism, myogenesis, mneurogenesis, and/or hematopoiesis is measured. 
10 Other methods to detect the products of gene expression are known 

including Southern blots (DNA detection), dot or slot blots (DNA, RNA), 
Northern blots (RNA), and RT-PCR (RNA). Although less preferred, labeled 
proteins can be used to detect a particular nucleic acid sequence to which it 
hybridizes. 

15 In some cases it is necessary to amplify the amount of a nucleic acid 

sequence. This may be carried out using one or more of a number of suitable 
methods including, for example, polymerase chain reaction ("PCR"), reverse- 
transcription polymerase chain reaction ("RT-PCR"), ligase chain reaction 
("LCR"), strand displacement amplification ("SDA"), transcription-based 

2 0 amplification, and the like. PCR is carried out in accordance with known 

techniques in which, for example, a nucleic acid sample is treated in the presence 
of a heat stable DNA polymerase, under hybridizing conditions, with one 
oligonucleotide primer for each strand of the specific sequence to be detected. An 
extension product of each primer which is synthesized is complementary to each 

25 of the two nucleic acid strands, with the primers sufficiently complementary to 
each strand of the specific sequence to hybridize therewith. The extension 
product synthesized from each primer can also serve as a template for further 
synthesis of extension products using the same primers. Following a sufficient 
number of rounds of synthesis of extension products, the sample is analyzed to 

30 assess whether the sequence or sequences to be detected are present. 

Nucleic acid array technology is a useful technique for determining 
differential mRNA expression. This type of analysis may be desirable when 
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analyzing gene function or downstream pathway effects of gene expression. 
Such technology includes, for example, oligonucleotide chips and DNA 
microarrays. These techniques rely on DNA fragments or oligonucleotides which 
correspond to different genes or cDNAs which are immobilized on a solid support 
5 and hybridized to probes prepared from total mRNA pools extracted from cells, 
tissues, or whole organisms and converted to cDNA. Oligonucleotide chips are 
arrays of oligonucleotides synthesized on a substrate using photolithographic 
techniques. Chips have been produced which can analyze up to 1700 genes. 
DNA microarrays are arrays of DNA samples, typically PCR products, that are 

!0 robotically printed onto a microscope slide. Each gene is analyzed by a full or 
partial-length target DNA sequence. Microarrays with up to 10,000 genes are 
now routinely prepared commercially. The primary difference between these two 
techniques is that oligonucleotide chips typically utilize 25-mer oligonucleotides 
which allow fractionation of short DNA molecules whereas the larger DNA 

15 targets of microarrays, approximately 1000 base pairs, may provide more 
sensitivity in fractionating complex DNA mixtures. 

The present invention may be better understood by reference to the 
following non-limiting Examples, which are provided as exemplary of the 
20 invention. 

EXAMPLE 1 



This Example describes a method for reducing or eliminating 
transcriptional interference between two tandemly arranged gene expression 
cassettes. Applicant has herein examined the influence of a gene on the 
expression of an adjacent gene. Briefly, two gene expression cassettes were 
cloned in all possible orientations to study the effect of orientation of one gene on 
the expression of the other gene. Applicant has now shown that transcriptional 
interference is orientation dependent and can be reduced or eliminated 
depending upon the arrangement of the gene expression cassettes in relation to 
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each other or by the insertion of spacer polynucleotides between the gene 
expression cassettes. 

GENERAL METHODS 
5 Standard recombinant DNA and molecular cloning techniques used herein 

are well known in the art and are described by Sambrook, J., Fritsch, E. F. and 
Maniatis, T. Molecular Cloning: A Laboratory Manual; Cold Spring Harbor 
Laboratory Press: Cold Spring Harbor, (1989) (Maniatis) and by T. J. Silhavy, 
M. L. Bennan, and L. W. Enquist, Experiments with Gene Fusions, Cold Spring 

10 Harbor Laboratory, Cold Spring Harbor, NY. (1984) and by Ausubel, F. M. et al., 
Current Protocols in Molecular Biology, pub. by Greene Publishing Assoc. and 
Wiley-Interscience (1987). 

Methods for plant tissue culture, transformation, plant molecular biology, 
and plant, general molecular biology may be found in "Plant Tissue Culture 

15 Concepts and Laboratory Exercises" edited by RN Trigiano and DJ Gray, 2 nd 
edition, 2000, CRC press, New York; "Agrobacterium Protocols" edited by KMA 
Gartland and MR Davey, 1995, Humana Press, Totowa, New Jersey; "Methods 
in Plant Molecular Biology" P. Maliga et al., 995, Cold Spring Harbor Lab Press, 
New York; and "Molecular Cloning" J. Sambrook et al., 1989, Cold Spring 

2 0 Harbor Lab Press, New York. 

Materials and methods suitable for the maintenance and growth of 
bacterial cultures are well known in the art. Techniques suitable for use in the 
following examples may be found as set out in Manual of Methods for General 
Bacteriology (Phillipp Gerhardt, R. G. E. Murray, Ralph N. Costilow, Eugene W. 
25 Nester, Willis A. Wood, Noel R. Krieg and G. Briggs Phillips, eds), American 
Society for Microbiology, Washington, DC. (1994) or by Thomas D. Brock in 
Biotechnology: A Textbook of Industrial Microbiology. Second Edition, Sinauer 
Associates, Inc., Sunderland, MA (1989). All reagents, restriction enzymes and 
materials used for the growth and maintenance of host cells were obtained from 

3 0 Aldrich Chemicals (Milwaukee, WI), DIFCO Laboratories (Detroit, MI), 

GIBCC7BRL (Gaithersburg, MD), or Sigma Chemical Company (St. Louis, MO) 
unless otherwise specified. 
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Manipulations of genetic sequences may be accomplished using the suite 
of programs available from the Genetics Computer Group Inc. (Wisconsin 
Package Version 9.0, Genetics Computer Group (GCG), Madison, WI). Where 
the GCG program "Pileup" is used the gap creation default value of 12, and the 
5 gap extension default value of 4 may be used. Where the CGC "Gap" or "Bestfit" 
program is used the default gap creation penalty of 50 and the default gap 
extension penalty of 3 may be used. In any case where GCG program 
parameters are not prompted for, in these or any other GCG program, default 
values may be used. 
10 The meaning of abbreviations is as follows: "h" means hour(s), "min" 

means minute (s), "sec" means second(s), "d" means day(s), "ul" means 
microliter (s), "ml" means milliliter(s), "L" means liter(s), "uM" means 
M micromolar, "mM" means millimolar, "og" means microgram(s), "mg" means 

S| milligram(s), "A" means adenine or adenosine, "T" means thymine or thymidine, 

\| 15 "G" means guanine or guanosine, "C" means cytidine or cytosine, "x g" means 
%i times gravity, "nt" means nucleotide(s), "aa" means amino acid(s), "bp" means 

base pair(s), "kb" means kilobase(s), "k" means kilo, "u" means micro, "%" means 
f percent, and "°C" means degrees Celsius. 



20 1.1 Plasmid Constructs 

Genes for firefly Photinus pyralis luciferase (LUC) and jellyfish Aequorea 
vactoria green fluorescent protein (GFP) suitable for expression in plant cells 
were constructed in pBlueScriptll SK~ using standard procedures (15). As 
illustrated in Figure 1A, the LUC gene construct contained the cassava vein 

25 mosaic virus (CsVMV) promoter (17), LUC coding sequence, and NOS terminator 
sequence. The GFP gene construct contained a duplicated cauliflower mosaic 
virus 35S promoter (E35S), tobacco etch virus (TEV) enhancer, GFP coding 
sequence, and 35S terminator sequence (14). As an internal control for 
protoplast assays, a Renilla Luciferase (RLUC) gene was constructed with a 35S 

3 0 promoter and NOS terminator. To obtain plasmids containing both genes in all 
possible orientations, the GFP gene was cloned before or after the LUC gene in 
four possible orientations (Figure IB). In addition, a 154 bp mammalian 
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transcription blocker (TB) sequence (5, SEQ ID NO: 1, GenBank accession 
number U89937), a 702 BstE II fragment (SEQ ID NO: 2), a 1519 Stu I fragment 
(SEQ ID NO: 3), or a 2322 bp Hind III fragment (SEQ ID NO: 4) from X phage 
DNA (GenBank accession number J02459) was cloned between the LUC and 
5 GFP gene cassettes (Figure 1C). An ecdysone receptor (EcR)-based, chemically 
inducible gene expression plasmid was constructed by cloning the chimeric GVE 
gene cassette (CsVMV promoter - GAL4 DNA binding domain:VP16 activation 
domain:EcR ligand binding domain - NOS terminator), inducible LUC gene 
cassette (5xGAL4 response element - minimal 35S promoter - LUC coding 
1° sequence - 35S terminator), and constitutive RLUC gene cassette (35S promoter - 
RLUC coding sequence - 35S terminator) in the orientation shown in Figure ID. 
All plasmid DNAs were prepared using Qiagen Plasmid Maxi kit. 

1.2 Protoplast Transfections 

Protoplasts were prepared from actively growing tobacco BY-2 suspension 
15 cells (Japan Tobacco, Japan) and cultured essentially as described in Kikkawa et 
al. (9). One ml of protoplasts (5x 10 5 to lx 10 6 cells) was mixed with plasmid 
DNA and electroporated at 0.56 K Volts for 80 ja seconds using an electroporation 
system with petripulser electrode (BTX). Equimolar plasmid DNA was added to 
transfections and the amount of DNA added varied from 23 to 30 jag depending 

2 0 on the plasmid size. To standardize the luciferase enzyme assays, 2 ug of the 

RLUC plasmid DNA were also added to each transfection as an internal control. 
Following the electroporation, protoplasts were diluted with 1 ml of 2x 
protoplast culture medium, aliquoted as two 1 ml cultures and incubated at 27°C 
for 16-17 h. Each plasmid construct was tested in 5 to 7 independent 
25 transfection experiments. 

1.3 Luciferase Enzyme Assay 

Protoplast cultures were lysed by adding 250 ul of 5x passive lysis buffer 
(Promega) and shaking at room temperature for 10 minutes. A 20 ul lysate from 
each culture (two assays per transfection) were assayed for luciferase (Luc) and 

3 0 Renilla luciferase (RLuc) enzyme activities using the Dual-Luciferase Reporter 

Assay Kit (Promega) according to the manufacturer's suggestions. The assay 
was performed with a luminometer plate reader (Dynex). The luciferase enzyme 
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activities were measured and expressed as ratios of luciferase to Renilla 
luciferase activities in relative light units. 
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1.4 Ecdysone Receptor-Based Gene Induction 

5 Ecdysone receptor (EcR)-based gene induction plasmid was constructed (as 

described in Example 1.1, see Figure ID) by cloning 1) a constitutively expressed 
receptor cassette comprising a CsVMV promoter, operably linked to a GVE 
(GAL4 DNA binding domain: VP 16 activation domain:EcR ligand binding 
domain) encoding polynucleotide and a NOS terminator polynucleotide and 2) an 

10 inducible reporter gene cassette comprising 5xGAL binding sites (response 
elements) operably linked to a minimal 35S promoter operably linked to a Luc 
encoding polynucleotide and a 35S terminator polynucleotide in a head-to-head 
orientation. A third expression cassette comprising a 35S promoter operably 
linked to a Renilla Luc encoding polynucleotide and a 35S terminator 

15 polynucleotide was then cloned downstream of the Luc gene expression cassette 
(see Figures ID and 3). Induction of the Luc gene expression cassette was 
controlled by the addition of EcR ligand methoxyfenozide. 



20 



1.5 RESULTS AND DISCUSSION 

Results are summarized in the following Tables 1 and 2. 
Table 1. 



Plasmid 


Luc/RLuc ratio 

Averages and (values) of 2 to 7 
transfections 


Relative 
to Luc— » 


Luc-» 


7.06 (7.16, 5.38, 6.99. 6.84, 8.63, 6.56, 
7.85) 


100 


GFP<- -^Luc 


10.05 (10.48, 10.59, 9.96, 10.62, 
10.76, 7.87) 


142 


Luc-> <-GFP 


3.31 (3.17, 3.14, 3.53, 3.58, 3.15) 


47 


GFP-> ->Luc 


1.41 (1.41, 1.39, 1.32, 1.40, 1.50, 1.43) 


20 
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Luc-> -»GFP 


7.38 (7.26, 6.95, 7.69, 7.56, 7.45) 


105 








(TB) Luc-> 
->GFP 


7.76 (8.20, 7.16, 8.67, 8.39, 6.38) 


110 


Luc-> (TB) 
<-GFP 


8.76 (10.33, 9.53, 9.55, 7.06, 7.37) 


123 


GFP^ (TB) 
— »Luc 


3.96 (3.82, 3.76, 3.09, 4.01, 4.26, 4.83) 


56 


GFP<- (TB) 
Luc— > 


11.73 (11.62, 13.24, 11.36, 11.17, 
13.3, 9.57) 


166 


GFP^ (2.3kb) 
— >Luc 


14.81 (14.68, 14.95) 


209 


GFP^ 


0.0003 (0.0003, 0.0004) 


0.04 



Table 2. 



Construct 


Luc 

Uninduced 


Luc 

Induced 


RLuc 

Uninduced 


RLuc 
Induced 


<— RLuc Luc<— 
->GVE 


1,902 


25,428 


30,588 


8,832 



fij To determine the transcriptional interference on the expression of one 

5 gene from its adjacent gene, two genes (promoter - coding sequence - terminator) 
were cloned in all four possible orientations (Figure IB). The promoters used for 
the expression of GFP and LUC in plant cells were cauliflower mosaic virus 35S 
(35S) and cassava vein mosaic virus (CsVMV) promoters. Both 35S and CsVMV 
promoters are commonly used to express transgenes in plant cells. Both the 

10 duplicated 35S promoter (E35S) and the CsVMV promoter showed similar 

promoter strength in tobacco BY-2 protoplasts in our previous experiments (data 
not shown). Specifically, the E35Sp — Luc — NOSt construct and CsVMVp — 
Luc — NOSt construct, when evaluated on separate plasmids have similar Luc 
activity showing that the two promoters used to express GFP and Luc in this 

15 study have similar strength (data not shown). 
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No endogenous LUC enzyme activity was detected in tobacco BY-2 
protoplasts. LUC enzyme activity was about 1 relative light unit (RLU) with a 
LUC/RLUC ratio of 0.0003 in protoplasts transfected with salmon sperm DNA or 
GFP gene (GFP ->). In contrast, protoplasts transfected with the LUC gene 
5 (LUC ->) showed LUC activities of about 50,000 RLUs with an average 

LUC/RLUC ratio of 7.07 (Figure 2). When the GFP gene was cloned upstream of 
the LUC gene in a head-to-tail orientation (GFP^ -»LUC), the LUC activity was 
reduced by —80%. However, no inhibitory interference was detected on LUC 
gene expression when the GFP gene was downstream of the LUC gene in the 

10 head-to-tail (LUC-> -»GFP) orientation (Figure 2). The LUC activity was 
reduced by 53% when GFP and LUC genes were in tail-to-tail (Luc-» -^GFP) 
orientation. There was no decrease in LUC gene expression when the 

orientation was head-to-head (GFP< >LUC). 

In order to test if the activation of an upstream promoter can cause 

15 transcriptional interference on downstream gene expression, an ecdysone 
receptor (EcR) based, chemically inducible gene expression system was used 
(unpublished). This system is similar to a glucocorticoid receptor based, 
dexamethasone-inducible gene expression system developed for plants by 
Aoyama and Chua (1). It contains a constitutively expressed chimeric GVE 

20 receptor gene (CsVMV promoter - sequences coding for GAL4 DNA binding 
domain, VP16 activation domain, and EcR ligand binding domain - NOS 
terminator) and an inducible promoter (five copies of GAL4 response element + 
minimal 35S promoter) to drive LUC gene expression (Figure ID). When the 
ligand methoxyfenozide (4) is applied to protoplasts, it binds to and activates the 

25 chimeric receptor. The activated receptor then induces LUC gene expression. 
This system was used to activate the upstream LUC gene, and the influence of 
LUC activation on the downstream constitutive RLUC gene expression was then 
measured. After the addition of a ligand to protoplasts, the LUC gene expression 
increased from 1,902 to 25,428 relative light units (RLU), while the RLUC 

3 0 expression decreased from 30,588 to 8832 RLU (Figure 3). Therefore, activation 
of the upstream Luc gene with ecdysone receptor- mediated inducible gene 
expression system reduced the expression of downstream, constitutively 
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expressed RLuc expression by 71%. This result reinforces the result obtained 
with the head-to-tail GFP-^ -»Luc construct. 

The above results show that the LUC gene expression was affected by the 
location and the orientation of adjacent GFP gene, and induction of the upstream 
promoter reduced the downstream gene expression. Neither the 35S nor the 
NOS terminator was sufficient in preventing the transcriptional interference 
between tandem genes. In an earlier study (8), the interference was blocked by 
placing a terminator (polyadenylation) sequence on either side of the transgene. 
In the construct GFP-» ->LUC (Figure 2), LUC activity was reduced 80% even 
though there were terminators before the LUC gene and after LUC coding 
sequence. It is possible that the strength of the promoter is important in 
determining the degree of transcriptional interference. The results described 
herein suggest that the transcriptional interference between the GFP and LUC 
genes was caused by RNA polymerase read- through and/or by antisense RNA 
production. The interference is probably not due to the local depletion of 
diffusible transcription factors or DNA topological constraints since the LUC 
gene expression was not reduced by the expression of GFP in all orientations. 
Applicant's results demonstrate that transcriptional interference can be 

avoided by cloning two genes in the head-to-head (< >) orientation. However, it 

is not possible to have head-to-head orientation when 3 or more genes are cloned 
in tandem. Also, the integration of a transgene into plants is random and 
adjacent genes on the chromosome can cause interference. It was reported in 
mammalian cells that poly(A) signal and a downstream transcriptional pause 
site are required for transcriptional termination by RNA polymerase II. Placing 
5 a poly(A) signal, a transcription pause site or both between tandem genes 
blocked the transcriptional interference (5). On this basis, an additional 
nucleotide sequence was placed after the upstream gene terminator to examine 
the effect on transcriptional interference on a downstream gene expression. A 
mammalian transcription blocker (TB) sequence was cloned between the GFP 
0 and LUC genes in head-to-tail (GFP^(TB)->LUC) and tail-to-tail 

(LUC->(TB)<-GFP) orientations (Figure 1C). The TB sequence (SEQ ID NO: 1) 
is a 154 bp DNA fragment containing a synthetic poly(A) site (nucleotides 1-49 of 
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SEQ ID NO: 1) and a transcription pause site from human alpha 2 globin gene 
(5; nucleotides 63-154 of SEQ ID NO: 1). As shown in Figure 2, transcriptional 
interference was completely blocked by the TB sequence in the construct 
LUC->(TB)<-GFP and partially blocked in the construct GFP -^(TB)-^LUC. As 
5 expected, cloning the TB sequence before the LUC gene in the construct 
(TB)LUC-» -»GFP and between the LUC and GFP genes in the construct 
LUC<— (TB)-»GFP did not increase the LUC activity as there was no interference 
in these orientations (Figure 2). 

The effect of other DNA fragments to eliminate or reduce the 
10 transcriptional interference in the construct GFP-> — >LUC was also examined. 
Three different DNA fragments from X phage [702 bp BstE II (SEQ ID NO: 2), 
1519 bp Stu I (SEQ ID NO: 3), or 2322 bp Hind III (SEQ ID NO: 4) fragments] 
Q were tested by cloning them between the GFP and LUC genes. As shown in 

Figure 4, the 702 bp and 1519 bp fragments partially negated the interference. 
15 The LUC/RLUC ratio for the construct GFP-* ->LUC increased from 1.41 to 4.42 
when a 702 bp X DNA fragment was inserted (GFP-K702)->LUC) and to 4.88 
when a 1519 bp fragment was inserted (GFP->(1519)-»LUC). Insertion of a 
2322 bp X DNA fragment between GFP and LUC genes (GFP-H2322)->LUC) not 
only eliminated the interference but also increased the LUC expression when 

2 0 compared to the construct LUC-> (Figure 4). The 702 bp and 1519 bp X DNA 
fragments have 46% and 48% AT content, respectively. In contrast, the 2322 bp 
fragment has 63% AT content. AT rich sequences can act as transcription 
termination signals and the elimination of interference by the 2322 bp fragment 
may be due to its higher AT content and/or longer length. 

25 The experiments described herein show that the transcriptional 

interference between adjacent genes can be significant and that this interference 

can be eliminated by cloning two genes in head-to-head (< >) orientation. 

When this orientation is not possible, particularly when three or more genes or 
gene expression cassettes are cloned, transcriptional interference can be reduced 

3 0 or eliminated by placing a spacer polynucleotide, such as a TB polynucleotide or 
a lambda phage polynucleotide, between the genes or cassettes. 
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EXAMPLE 2 



This Example describes an ecdysone receptor-based chemical-inducible 
system for plant gene regulation. Applicant has developed an inducible gene 
5 expression system with potential for field application using the spruce budworm 
Choristeneuria fumiferana (Cf) ecdysone receptor (EcR) and non-steroidal 
ecdysone agonists. Chimeric transcription factors were made using different 
DNA binding and activation domains and an EcR ligand binding domain. A 
reporter gene luciferase was cloned downstream of a DNA element where this 

1 0 chimeric transcription factor can bind. This chimeric transcription factor does 
not activate transcription in the absence of a ligand. Addition of the ligand 
methoxyfenozide, which has exceptional health and environmental safety 
profiles, induced luciferase expression. Applicant has used this chemical- 
inducible system in transient assays with tobacco protoplasts and in transgenic 

15 Arabidopsis and tobacco plants. Applicant's results show that the system based 
on the EcR ligand binding domain is very effective and can be used to express 
high levels of protein. 

Introduction: An inducible system to activate or inactivate plant gene 
2 0 expression has many potential applications in basic understanding of gene 
function, in manipulating complex developmental pathways, and in plant 
biotechnology. Many chemical-inducible systems have been developed for plants 
(18-20). Systems based on plant promoters whose expression is triggered by 
chemicals (e.g., salicylic acid) may not be suitable for field application because 
2 5 endogenous chemically inducible promoters may have high basal expression and 
may respond to environmental and plant signals in addition to applied chemical. 
Systems based on artificial chemical-induction use components derived from 
non-plant sources and have many advantages. 

The chemical-inducible gene regulation systems comprise two 
30 transcription units. The product of the first transcription unit is a activated 
transcription that is responsive to chemicals. The second transcription unit is 
under the control of transcription factor and consists of responsive element and 
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minimal promoter where transcription factor can bind. The chemical-inducible 
systems developed for plants include Tet repressor-based, tetracycline de- 
repressible (21), tTA-based, tetracycline inactivatable (22), glucocorticoid 
receptor-based, dexamethasone inducible (1), AlcR-based, ethanol inducible (23), 
5 ecdysone receptor (EcR)-based, ecdysone inducible (24), estrogen receptor-based, 
(3-estradiol inducible (25), and tetracycline and glucocorticoid receptors-based, 
dual control (26). An ideal gene expression system will have the following 
desirable properties (20): low basal expression levels, high inducibility, 
specificity to inducer, high dynamic range to inducer concentrations, fast 
10 response, switch-off after removal of inducer, and safe inducer. 

EcR-Based Plant Gene Regulation System: Growth, molting, and 
development in insects are regulated by the ecdysone steroid hormone (molting 
hormone) and the juvenile hormones (4). The molecular target for ecdysone in 
insects comprises at least an ecdysone receptor (EcR) and an ultraspiracle 
protein (USP). EcR is a member of the nuclear steroid receptor super family that 
is characterized by signature DNA and ligand binding domains, and an 
activation domain (27). Tebufenozide and methoxyfenozide are non-steroidal 
ecdysone analogs that are marketed worldwide by Rohm and Haas Company as 
insecticides. Both analogs have exceptional safety profiles to other organisms. 

Applicant has developed an inducible gene regulation system suitable for 
large-scale field application using EcR from spruce budworm and non-steroidal 
ecdysone agonists (see Figure 5). The first version of the system comprises a 
chimeric transcription activator (GVE, GAL4 DNA binding domain, VP 16 
activation domain, and EcR ligand binding domain) under the control of cassava 
vein mosaic virus (CsVMV) promoter. The reporter gene luciferase (Luc) is 
downstream of 5xGAL4 response element and minimal 35S promoter. 

Results: 

All binary vector constructs have four genes; NOSp-NPTII selectable 
marker, 35Sp-RLuc constitutive internal control marker, CsVMVp-GVE receptor, 
and 5xGAL+M35Sp-Luc reporter. The receptor and reporter genes were cloned 
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in all possible orientations; tail-to-tail (GVE-> <-Luc), head-to-tail (GVE-> -»Luc 

and Luc— >• — »GVE), and head-to-head (Luc< >GVE). Arabidopsis and tobacco 

plants were transformed by employing standard protocols. Leaf disks taken from 
TO plants were incubated in the presence or absence of 10 mM methoxyfenozide 
5 for 24 hours and induced Luc and constitutive Rluc activities were assayed using 
Dual Luciferase Assay Kit from Promega. The Luc activity is presented as the 
ratio between Luc and Rluc. Tl seed was germinated on agar media containing 
no or 10 mM methoxyfenozide and assayed for Luc and Rluc activity 3 weeks 
after germination. Applicant's results are discussed below and are presented in 
10 Figures 6-13 and in Tables 3-5. 

<-NOS-NPTII + <-35S-RLuc + CsVMV-GVE-> + <-5xGAL+M35S-Luc 
Construct 

In this construct, the receptor and reporter are in tail-to-tail orientation. 
Results from Arabidopsis TO plants are shown in Figure 6 and results from 
tobacco TO plants are shown in Figure 7. 

The uninduced Luc activity in Arabidopsis plants (Figure 6) ranged from 
0.09-89 relative light units (RLU) per mg protein while the induced activities 
ranged from 2-3558 RLUs/mg. Induced Luc activity in some plants reached to 
the similar levels as in 35S-Luc plants. Non-transgenic, negative control plants 
bad Luc activities of 0.04-1.02 RLU/mg. Fold-induction varied from 1-6963 and 
will be higher if Luc background activity value from non-transgenic plants is 
subtracted. Constitutive RLuc activity, which ranged from 32-2762, was used as 
an internal control and Luc activity is shown as Luc/RLuc ratio. 

<-NOS-NPTII + <-35S-Rluc + CsVMV-GVE-> + ->5xGAL+M35S-Luc 
Construct 

The receptor and reporter are in head-to-tail orientation. Data generated 
from analyses of this construct in Arabidopsis TO and Tl plants and Tobacco TO 
plants are shown in Figures 8-10. 
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<-NOS-NPTII + <-35S-Rluc + 5xGAL+M35S-Luc<- + ->CsVMV-GVE 
Construct 

The reporter and receptor are in head-to-head orientation. Data generated 
from analyses of this construct in Arabidopsis TO and Tl plants and Tobacco TO 
5 plants are shown in Figures 11-13. 



<-NOS-NPTII + <-35S-Rluc + 5xGAL+M35S-Luc^ + ->CsVMV-GVE 
Construct 

The reporter and receptor are in head-to-tail orientation. Data generated 
from analyses of this construct in Tobacco TO and Tl plants are summarized in 
Table 3. 



M Table 3. Summary of Reporter/Receptor Constructs in Tobacco TO and Tl Plants 



Construct 


Plants (n) 


>3F-I* 


Range (Mean) 
LUC, Uninduced 


Range (Mean) 
LUC. Induced 


GVE->->LUC 


21 


TO: 


6 


0.3-37 (7) 


3-335 (133) 






Tl: 


5 


0.1-7 (3) 


43-1167 (641) 


GVE->«-LUC 


15 


TO: 


9 


0.1-2 (1) 


9-220 (67) 






Tl: 


8 


0.3-10 (4) 


6-633 (204) 


LUC< — >GVE 


20 


TO: 


12 


7-107 (23) 


141-299 (234) 






Tl: 


11 


18-210 (71) 


97-1126 (729) 


LUC->^GVE 


25 


TO: 


14 


0.4-47 (12) 


2-530 (258) 






Tl: 


16 


0.4-121 (51) 


56-2433 (1472) 


VGE^^LUC 


19 


TO: 


5 


0.2-1 (0.4) 


4-59 (32) 






Tl: 


7 


1.8-13 (7) 


157-489 (295) 


VGE^<-LUC 


26 


TO: 


10 


0.1-8 (2) 


4-287 (142) 






Tl: 


13 


0.1-3 (1) 


3-940 (330) 


LUC< — >VGE 


21 


TO: 


10 


0.6-29 (7) 


14-286 (121) 






Tl: 


9 


3-102 (38) 


102-1738 (832) 



F-I* refers to "fold-induction" 



Applicants also tested the effect of the order or arrangement of GAL4, 
VP 16 and EcR domains within the receptor construct on the uninduced and 
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induced Luc expression levels (see Tables 3-5). Tobacco (BY2) protoplasts were 
transfected with reporter plasmid and three different receptor plasmids in which 
the location of VP 16 (with respect to other domains) was varied. Protoplasts 
were incubated in the presence (induced) or absence (uninduced, UI) of 10 mM 
methoxyfenozide and Luc activity was assayed 17 hours of incubation (see Tables 
4 and 5). Higher fold induction was observed for VGE because of reduction in 
basal activity. 



Table 4. Protoplast assays with different receptor constructs and reporter on the 
10 same plasmid. 



15 



20 



25 



30 



GVE 2170 


29173 


14 


VGE 516 


21063 


43 


GEV 363 


5802 


16 


Table 5. Protoplast assavs with receptor and reporter on tbe same p 


Construct 


Luc (RLU) 


Fold-Induction 


GVE^ + ->Luc 






UI 


2,826 




Induced 


17,334 


7.63 


GVE-» + <-Luc 






UI 


1,786 




Induced 


8,970 


5.62 


Lue-> + ->GVE 






UI 


5,941 




Induced 


29,146 


5.87 


Luc<- + ->GVE 






UI 


10,704 




Induced 


48,365 


4.90 



3 5 Conclusions: Arabidopsis and tobacco plants with low uninduced and high 
induced levels of luciferase were obtained by optimizing receptor (GVE) and 
reporter gene orientations. The uninduced luciferase activity in some plants was 
very low (3-5 relative light units compared to 1-3 RLU for control plants). The 
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induced levels in some plants reached the levels of the constitutive 35S-Luc 
control (data not shown). The fold-induction was increased as high as 27,000 
fold. 

The present invention is not to be limited in scope by the specific 
5 embodiments described herein. Indeed, various modifications of the invention in 
addition to those described herein will become apparent to those skilled in the 
art from the foregoing description and the accompanying figures. Such 
modifications are intended to fall within the scope of the appended claims. It is 
further to be understood that all base sizes or amino acid sizes, and all molecular 
1° weight or molecular mass values, given for nucleic acids or polypeptides are 
approximate, and are provided for description. 

REFERENCES 

1. Aoyama, T. and N.-H. Chua. 1997. Plant J. 11: 605-612. 
15 2. Bhattacharyya, M.K., B.A. Stermer and R.A. Dixon. 1994. Plant J. 6: 957-968. 

3. Breyne, P., G. Gheysen, A. Jacobs, M. V. Montagu and A. Depicker. 1992. 
Mol. Gen. Genet. 235: 389-396. 

4. Dhadialla, T.S., G.R. Carlson andD.P.Le. 1998. Annu. Rev. Entomol. 43: 
545-569. 

205. Eggermont, J. and N.J. Proudfoot. 1993. EMBO J. 12: 2539-2548. 

6. Greger, I.H., A. Aranda, and N.J. Proudfoot. 2000. Proc. Natl. Acad. Sci. USA 
97: 8415-8420. 

7. Grierson, D., R.G. Fray, A.J. Hamilton, C.J.S. Smith, and C.F. Watson. 1991. 
Trends Biotechnol. 9: 122-123. 

25 8. Ingelbrecht, L, P. Breyne, K. Vancompernolle, A. Jacobs, M. V. Montagu and 
A. Depicker. 1991. Gene 109: 239-242. 

9. Kikkawa, H., T. Nagata, C. Matsui and I. Takebe. 1982. J. Gen. Virol. 63: 
457-467. 

10. Mol, J., R. van Blokland and J. Looter. 1991. Trends Biotechnol. 9: 182-183. 
30 11. Paszty, C.J.R. and P. F. Lurquin. 1990. Plant Sci. 72: 69-79. 

12. Peach, C. and J. Velten. 1991. Plant Mol. Biol. 17: 49-60. 

13. Proudfoot, N.J. 1986. Nature 322: 562-565. 



-49- 



14. Reichel, C, J. Mathur, P. Eckes, K. Langenkemper, C. Koncz, J. Schell, B. 
Reiss and C. Maas. 1996. Proc. Natl. Acad. Sci. USA 93: 5888-5893. 

15. Sambrook, J., E.F. Fritsch and T. Maniatis. 1989. Molecular cloning: A 
laboratory manual (2 nd Edition). Cold Spring Harbor Laboratory Press. 

5 16. Thompson, A.J. and S.C. Myatt. 1997. Plant Mol. Biol. 34: 687-692. 

17. Verdaguer, B., K. Alexandre, R. N. Beachy and C. Fauquet. 1996. Plant 
Mol. Biol. 31: 1129-1139. 

18. Gatz, C. 1997. Annu. Rev. Plant Physiol. Plant Mol. Biol., 48:89-108. 

19. Gatz, C. & Lenk, I. 1998. Trends Plant Sci., 3:352-358. 

10 20. Zuo, J. & Chua, N-H. 2000. Current Opinion Biotech., 11:146-151. 

21. Gatz, C. et al. 1992. Plant J., 2:397-404. 

22. Weinmann, P. et al. 1994. Plant J., 5:559-569. 

23. Caddick, M.X. et al. 1998. Nat. Biotechnol., 16:177-180. 

24. Martinez, A. et al. 1999. Plant J., 19:97-106. 
15 25. Bruce, W. et al. 2000. Plant Cell, 12:65-80. 

26. Bohner, S. et al. 1999. Plant J., 19:87-95. 

27. Mangelsdorf, D.J. et al. 1995. Cell, 83:835-839. 



fll 



